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CAROTENOID KETOLASE GENES AND GENE PRODUCTS, 
PRODUCTION OF KETOCAROTENOIDS AND METHODS OF 
MODIFYING CAROTENOIDS USING THE GENES 

BACKGROUND OF THF INVENTION 
5 Carotenolds are widely distributed natural pigments that are responsible for 

nnany of the yellow, orange and red colors seen in living organisms. They have 
important commercial uses as coloring agents in the food industry, as feed and food 
additives, in cosmetics and as provitamin A precursors. 

The plant species Adonis aestivalis produces flowers with petals that are deep 
10 red in color and nearly black at the base of the petals due to the accumulation of 
ketocarotenoid and other carotenoid pigments (Neamtu et aL, Rev. Roum, Biochim, 
6:157, 1969). This pattern of carotenoid accumulation accounts for the common name 
of some varieties of this species: summer pheasant's eye. 

Among the carotenolds identified in the petals of the red petal varieties of these 
15 various species is the ketocarotenoid astaxanthin (3.3'-dihydroxy-4,4'-diketo-b,b- 
carotene; see Figure 1). Various other ketocarotenoids (see Figure 1) including 3- 
hydroxyechinenone ( 3-hydroxy-4-keto-b,b-carotene), adonirubin (3-hydroxy-4.4-diketo- 
b.b-carotene) adonixanthin (3,3'-dihydroxy-4-keto-b,b-carotene) and isozeaxanthin 
(4,4'-dihydroxy-b,b-carotene; see T.W. Goodwin, The Biochemistry of the Carotenoids, 
20 vol I. Plants, 2nd edition, 1980, page 147) have also been reported. The latter 
compound is consistent with speculation that the 4-hydrbxy may be an intermediate In 
the formation of the 4-keto group. 

SUMMARY OF THF INVRMTIOM 

There is appreciable interest In the biological production of carotenoids. In 
25 particular the orange-colored ketocarotenoids such as astaxanthin and canthaxanthin 
(Figure 1), and in the modification of carotenoid composition. For this reason, an A. 
aestivalis flower cDNA library was constructed and screened for cDNAs encoding 
enzymes (hereinafter referred to as "ketolases" although the specific biochemical 
activity has not yet been established) involved In the conversion of b-carotene into 
30 orange compounds with absorption properties similar to those exhibited by common 
ketocarotenoids such as canthaxanthin (Figure 1). Two distinctly different Adonis 
aestivalis cDNAs were obtained from among a number of cDNAs that were selected on 
this basis. 
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Thus, a first aspect of the present invention is a purified nucleic acid sequence 
which encodes for a protein having ketolase enzyme activity and has the nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
5 a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
ID NO: 2 or 4. 

The invention also includes vectors which comprise any portion of the nucleic 
acid sequences listed above, and host cells transformed with such vectors. 

Another aspect of the present invention is a method of producing a 
10 ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and comprises 
(1) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 
SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
15 to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketolase enzyme. 

Another subject of the present invention is a method of modifying the production 
of carotenoids in a host cell, relative to an untransformed host cell, the method 
20 comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and comprises (1 ) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
25 nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host celt, relative to an untransformed 
host ceil. 

BRIEF DESCRIPTION QF THE DRAWINGS 
30 A more complete appreciation of the invention and many of the attendant 

advantages thereof will be readily obtained as the same becomes better understood by 
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reference to the following detailed description when considered in connection with the 
accompanying drawings. 

Figure 1 illustrates structures and biochemical routes leading from b-carotene to various 
of the ketocarotenoids referred to in the text. Conversion of p-carotene to astaxahthin 
5 by a hydroxylase enzyme (Hy) and a ketolase enzyme (keto) could proceed via any one 
or all of several possible routes depending on the order of the reactions. 

Figure 2 illustrates the beta ring structure of b-carotene and various modifications of this 
parent ring that might be produced through the action of the products of the A. aestivalis 
ketolase cDNAs. Also shown is the structure of the epsilon ring, not found to be a 
10 substrate for the A. aestivalis ketolases and present in carotenoids such as d-carotene, 
e-carotene, a-carotene and lutein. 

Figure 3 illustrate results obtained with TLC (thin layer chromatography) separation of 
carotenoid pigments extracted from E. coli cultures, previously engineered to produce 
b-carotene, but that now also contain the A. aestivalis ketolase cDNAs and/or other 

15 introduced genes and cDNAs. The Figure indicates the empty plasmid vector 
pBluescript SK- (SK-). the Adonis aestivalis ketolase 1 cDNA in this plasmid vector (Ad 
ketol), the Haematococcus pluvialis ketolase cDNA in this plasmid vector Hp keto), or 
the Arabidopsis p-carotene hydroxylase cDNA (At Ohase). Bands that were orange in 
color are shown here with a darker fill than those with a yellow color. Identities of 

20 various bands are indicated to the right of the band. 

Figure 4 illustrates the absorption spectrum of one of the orange carotenoids produced 
from b-carotene via the action of the Adonis ketolases and makes clear the similarity 
of the spectrum to that of canthaxanthin. Absorption spectra (in acetone) of p-carotene, 
canthaxanthin and an unknown orange product (orange band #1 ; the lower orange 
25 band in the first lane of Figure 3) extracted from cultures after introduction of the Adonis 
aestivalis ketol cDNA (SEQ ID NO: 1 ) in cells of E. coli that othenvise produce and 
accumulate p-carotene. The absorption spectrum of the unknown resembles that of 
canthaxanthin but the compound migrates to a position below echinenone on RP18 
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TLC plates developed with a mobile phase of methanol :acetone (1:1 by volume). The 
absorption spectrum of orange band #2 also is similar to that of canthaxanthin but it 
migrates more rapidly than canthaxanthin indicating that it is probably a more polar 
compound. 

5 Figure 5 shows SEQ ID NO: 5 (the sequence shown In this Figure includes SEQ ID NO: 

1 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

Figure 6 shows SEQ ID NO: 6 (the sequence shown in this Figure includes SEQ ID NO: 

2 and also includes a translation of amino acids resulting from the adapator DNA and 
10 the multiple cloning site (MCS) of the library cloning vector and the start codon from the 

plasmid vector pTrcHis, which sequences are shown in bold and capitalized). 

Figure 7 shows SEQ ID NO: 7 (the sequence shown in this Figure includes SEQ ID NO: 

3 and also includes some of the flanking DNA from the adapator DNA and the multiple 
cloning site (MCS) of the library cloning vector, which sequences are shown in bold). 

15 Figure 8 shows SEQ ID NO: 8 (the sequence shown in this Figure includes SEQ ID NO: 

4 and also includes a translation of amino acids resulting from the adapator DNA and 
the multiple cloning site (MCS) of the library cloning vector and the start codon from the 
plasmid vector, which sequences are shown in bold and capitalized). 

Figure 9 shows a "Gap" alignment of the two Adonis ketolase sequences of the 
20 invention. A truncated version of SEQ ID NO: 1 is shown in this Figure for comparitive 
purposes, and is designated SEQ ID NO: 9. The percentage identity was calculated 
to be 91.107. 

Figure 10 shows a "Gap" alignment of SEQ ID NO: 2 and 4, The following results were 
found: 

25 Gap weight: 12 average match: 2,912 



Length weight: 4 



average mismatch: -2.003 
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Quality: 1440 length: 307 

Ratio: 4.691 gaps: 0 

percent similarity: 92.182 percent identity: 90.228 

Figure 1 1 shows a comparison between SEQ ID NO: 2 and the Arabidopsis thaliana P- 
5 carotene hydroxylase enzyme (GenBank U58919) (SEQ ID NO: 10). 

Figure 12A shows gDNA (SEQ ID NO: 11) immediately upstream of the cDNA of SEQ 
ID NO: 3. The sequence was obtained from a PGR product generated using the 
GenomeWalker kit of Clontech Laboratories, Inc. (1020 East Meadow Circle, Palo Alto, 
CA 94303-4230) and nested primers specific to the ketolases of Adonis aestivalis 
10 (cagaatcggtctgttctattagttcttcc (SEQ ID NO: 17) and caatttgaggaatatcaaggttccttgttctc 
(SEQ ID NO: 18)). The termination codon upstream of and in-frame with initiation 
codon (TAA at positions 204-206) is shown in bold. Initiation codon (ATG) is also 
shown in bold. 

Figure 12B (SEQ ID NO: 12) indicates that the full length polypeptide of SEQ ID NO: 
15 4 begins with the amino acids MAA (shown in bold) immediately preceding the ketolase 
sequence shown in Figure 8. A similar MAA amino acid sequence immediately 
preceding SEQ ID NO: 1 is also expected. 

Figure 13 shows an alignment of SEQ ID NO: 2, SEQ ID NO: 12, an Arabidopsis P- 
20 carotene hydroxylase enzyme (predicted product of GenBank U58919) (SEQ ID NO: 
13), a putative second Arabidopsis hydroxylase predicted by genomic DNA sequence 
(GenBank AB025606; the exon/intron junctions were chosen with reference to the 
product of the Arabidopsis P-carotene hydroxylase cDNA u58919) (SEQ ID NO: 14). 
and two Capsicum annuum P-carotene hydroxylases (predicted products of GenBank 
25 Y09722 and Y09225) (SEQ ID NO: 1 5 and 1 6). 

DESCRIPTION OF THF PRFFFRRED EMBODIMENTS 
The present invention is directed to a purified nucleic acid sequence which 
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encodes for a protein having ketolase enzyme activity and has the nucleic acid 
sequence of SEQ ID NO: 1 or 3. 

The invention also includes a purified nucleic acid sequence which encodes for 
a protein having ketolase enzyme activity and having the amino acid sequence of SEQ 
5 ID NO: 2 or 4. 

Two different but closely-related nucleic acids have been isolated. The 
sequences of the longest example of each are presented herein. Sequencing which 
has subsequently been conducted of upstream genomic DNA indicates that SEQ ID 
NO: 3 lacks bases encoding the first three amino acids (MAA; see Figure 12). Likely, 

10 this is also the case for SEQ ID NO: 1 , but the upstream genomic sequences have not 
yet been obtained for this nucleic acid. 

The two different Adonis ketolases denoted in SEQ ID NO: 1 and 3 are similar 
in sequence, sharing about 91 % identity, as determined by the Gap program discussed 
below (see Figure 9). The predicted amino acid sequences of the enzymes denoted in 

15 SEQ ID NO: 2 and 4 share about 92% similarity and about 90% identity, also as 
determined by the Gap program (see Figure 10). 

Therefore, it is clear that certain modifications of SEQ ID NO: 1 or 3 or SEQ ID 
NO: 2 or 4 can take place without destroying the activity of the enzyme. Note also that 
certain truncated versions of the cDNAs of SEQ ID NO: 1 or 3 were found to be 

20 functional (i.e., these cDNAs retained the property of causing the conversion of b- 
carotene to orange compounds). Also, the Arabidopsls (3-carotene hydroxylase 
(GenBank U58919), aligned with the ketolase SEQ ID NO: 2 in Figure 11, retains 
catalytic function when truncated to yield a polypeptide that lacks the first 129 amino 
acids (Sun et al., 1996). From the alignment in Figure 1 1 , therefore, this would suggest 

25 that the two ketolases of the invention retain catalytic activity after truncation to remove 
bases encoding the first 132 amino acids. 

Thus, the present invention is intended to include those ketolase nucleic acid 
and amino acid sequences in which substitutions, deletions, additions or other 
modifications have taken place, as compared to SEQ ID NO: 1 or 3 or SEQ ID NO: 2 

30 or 4, without destroying the activity of the ketolase enzyme. Preferably, the 
substitutions, deletions, additions or other modifications take place at those positions 
which already show dissimilarity between the present sequences. For SEQ ID NO: 1 , 
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as shown in Figure 9. these positions are as follows: positions 7, 20, 23, 35, 53. 63, 65. 
67, 76, 78. 85, 86. 91. 107. 109-111. 135. 140. 144. 146. 160. 168, 217, 219. 241. 249, 
254, 256, 271, 291, 296. 349, 389. 400. 406. 431, 448. 449. 460. 471. 499, 530. 589. 
619, 643, 653, 654, 667, 679, 709, 731, 742. 784. 787, 836. 871, 883. 896, 911. 919, 
5 928. 930. 939. 943. 967, 969, 978, 979, 982, 988, 995, 1005, 1006, 1012-1014, 1017, 
1019-1021, 1023, 1025, 1049, 1050, 1054, 1060-1068, 1070-1073, 1075, 1094, 1100, 
1101. 1106, 1107. 1109 and 1111-1176. For SEQ ID NO: 3. as shown in Figure 9, 
these positions are as follows: positions 7. 20, 23, 35. 53. 63. 65. 67. 76, 78. 85, 86, 91 . 
107. 109-111. 135. 140. 144, 146, 160, 168, 217, 219. 241, 249. 254, 256. 271. 291, 
10 296. 349. 389. 400, 406. 431 , 448, 449, 460. 471 , 499, 530, 589, 619, 643. 653. 654. 
667. 679. 709, 731, 742. 784, 787. 836, 871. 883, 896, 91 1, .919. 928. 930. 939. 943. 
966, 967. 970. 979, 980. 983, 989, 996, 1006, 1007, 1013-1015, 1018, 1020-1022, 
1024. 1026, 1050, 1051, 1055, 1062-1065, 1067, 1086, 1092. 1093, 1098, 1099, 1.101 
and 1103-1112. 

15 For SEQ ID NO: 2 and 4, as shown in Figure 10, the following amino acids can 

be substituted or deleted, or additions or other modifications can be made, without 
destroying the activity of the ketolase enzyme: positions 7, 8, 12, 18, 21, 22, 25, 26, 36, 
37, 45, 47-49, 56, 73, 83, 85, 97, 99, 130, 144, 150, 157. 166, 218, 244, 279, 299 and 
304. Therefore, the present invention also Intends to cover amino add sequences 

20 where such changes have been made. 

In each case, nucleic acid and amino acid sequence similarity and identity Is 
measured using sequence analysis software, for example, the Sequence Analysis, Gap, 
or BestFIt software packages of the Genetics Computer Group (University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAIign 

25 (DNAStar, Inc., 1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford 
Molecular Group. 2105 S. Bascom Avenue, Suite 200, Campbell, Callfomla 95008). 
Such software uses algorithms to match similar sequences by assigning degrees of 
identity to various substitutions, deletions, and other modifications, and Includes 
detailed instructions as to useful parameters, etc., such that those of routine skill in the 

30 art can easily compare sequence similarities and Identities. An example of a useful 
algorithm in this regard is the algorithm of Needleman and Wunsch, which is used In the 
Gap program discussed above. This program finds the alignment of two complete 
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sequences that maximizes the number of matches and minimizes the number of gaps. 
Another useful algorithm is the algorithm of Smith and Waterman, which is used in the 
BestFit program discussed above. This program creates an optimal alignment of the 
best segment of similarity between two sequences. Optimal alignments are found by 
5 inserting gaps to maximize the number of matches using the local homology algorithm 
of Smith and Waterman. 

Conservative (i.e. similar) substitutions typically include substitutions within the 
following groups: glycine and alanine; valine, Isoleuclne and leucine; aspartic acid, 
glutamic acid, asparagine and glutamine; serine and threonine; lysine and arginine; and 

1 0 phenylalanine and tyrosine. Substitutions may also be made on the basis of conserved 
hydrophobicity or hydrophilicity (see Kyte and Doolittle, J. Mol. Biol. 157: 105-132 
(1982)), or on the basis of the ability to assume similar polypeptide secondary structure 
(see Chou and Fasman, Adv, Enzymol. 47: 45-148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 

15 comparison sequences is at least 50 nucleotides, more preferably at least 60 
nucleotides, at least 75 nucleotides or at least 100 nucleotides. It is most preferred if 
comparison is made between the nucleic acid sequences encoding the enzyme coding 
regions necessary for enzyme activity. If comparison is made between amino acid 
sequences, preferably the length of comparison is at least 20 amino acids, more 

20 preferably at least 30 amino acids, at least 40 amino acids or at least 50 amino acids. 
It is most preferred if comparison is made between the amino acid sequences in the 
enzyme coding regions necessary for enzyme activity. 

While the two different Adonis ketolase enzymes of the present invention are 
similar in sequence, previously-described bacterial (Misawa et al., 1995), cyanobacterial 

25 (Fernandez-Gonzalez et al.,1997), and green algal {Haematococcus pluvialis; Lotan et 
al., 1995; Kajiwara et al., 1995) (3-carotene ketolase enzymes bear little resemblance 
to the Adonis ketolases, although certain histidine motifs and features of the predicted 
secondary structure are common to the polypeptides predicted by both groups 
(Cunningham and Gantt, 1998). 

30 The present invention also includes vectors containing the nucleic acids of the 

invention. Suitable vectors according to the present invention comprise a gene 
encoding a ketolase enzyme as described above, wherein the gene is operably linked 
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to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et al.. Molecular Cloning 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor. NY. 1989; 
Ausubel et al., Current Protocols in Molecular Biology , Greene Publishing and Wiley 
5 Interscience. New York, 1991 ). Suitable vectors for eukaryotic expression in plants are 
described in Fray et al., (1995; Plant J. 8:693-701) and Misawa et al. (1994; Plant J, 
6:481-489). Suitable vectors for prokaryotic expression include pACYC184. pUC1 19, 
and pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis 
(Invitrogen) and pET28 (Novagen) and derivatives thereof. The vectors of the present 

10 invention can additionally contain regulatory elements such as promoters, repressors, 
selectable markers such as antibiotic resistance genes, etc., the construction of which 
is very well known in the art. 

The genes encoding the ketolase enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a host 

15 cell expression system or to inhit)jt the expression of these enzymes. For example, a 
vector containing a gene of the invention may be used to increase the amount of 
ketocarotenoids in an organism and thereby alter the nutritional or commercial value or 
pharmacology of the organism. A vector containing a gene of the invention may also 
be used to modify the carotenoid production in an organism. 

20 Therefore, the present invention includes a method of producing a 

ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and comprises 
(1) SEQ ID NO: 1 or 3 or (2) a sequence which encodes the amino acid sequence of 

25 SEQ ID NO: 2 or 4, wherein the heterologous nucleic acid sequence is operably linked 
to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

The present invention also includes a method of modifying the production of 
30 carotenoids in a host cell, relative to an untransformed host cell, the method comprising 
inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
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ketolase enzyme activity and comprises (1 ) SEQ ID NO: 1 or 3 or (2) a sequence which 
encodes the amino acid sequence of SEQ ID NO: 2 or 4, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
5 modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

The term "modifying the production" means that the amount of carotenoids 
produced can be enhanced, reduced, or left the same, as compared to an 
untransformed host cell. In accordance with one embodiment of the present invention, 

10 the make-up of the carotenoids (i.e., the type of carotenoids produced) Is changed vis 
a vis each other, and this change in make-up may result in either a net gain, net loss, 
or no net change in the amount of carotenoids produced in the cell. In accordance with 
another embodiment of the present invention, the production or the biochemical activity 
of the carotenoids (or the enzymes which catalyze their formation) is enhanced by the 

15 insertion of the ketolase enzyme-encoding nucleic acid. In yet another embodiment of 
the invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) may be reduced or inhibited by a number of 
different approaches available to those skilled in the art, including but not limited to 
such methodologies or approaches as anti-sense (e.g.. Gray et al. (1992), Plant Mol. 

20 BioL 19:69-87). ribozymes (e.g., Wegener et al (1994) Mol Gen. Genet 1994 Nov 
15;245(4):465-470), co-suppression (e.g. Fray et al. (1993) Plant MoL BioL 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. Plant J. 
11:11 95-1 206, 1 997), intracellular antibodies (e.g., see Rondon et al. (1 997) Annu. Rev. 
Microbiol. 51:257-283) or whatever other approaches rely on the knowledge or 

25 availability of the nucleic acid sequences of the invention, or the enzymes encoded 
thereby. 

Host systems according to the present invention preferably comprise any 
organism which is capable of producing carotenoids, or which already produces 
carotenoids. Such organisms include plants, algae, certain bacteria, cyanobacteria and 
30 other photosynthetic bacteria. Transformation of these hosts with vectors according to 
the present invention can be done using standard techniques. See, for example, 
Sambrook et al.. Molecular Cloning A Laboratory Manual . Cold Spring Harbor 
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Laboratory, Cold Spring Harbor, NY, 19.89; Ausubel et al.. Current Protonols in 

Molecular Biology, Greene Publishing and Wiley Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the nucleic 

acid sequences of the present invention. The incorporation of these sequences can 
5 allow the controlling of carotenoid biosynthesis, content, or composition in the host cell. 

These transgenic systems can be constructed to incorporate sequences which allow 

for the overexpression of the various nucleic acid sequences of the present invention. 

Transgenic systems can also be constructed which allow for the underexpression of the 

various nucleic acid sequences of the present invention. Such systems may contain 
10 anti-sense expression of the nucleic acid sequences of the present invention. Such 

anti-sense expression would result in the accumulation of the substrates of the enzyme 

encoded by the sense strand. 

Having generally described this invention, a further understanding can be 

obtained by reference to certain specific examples which are provided herein for 
15 purposes of illustration only and are not intended to be limiting unless otherwise 

specified. 

EXAMPLE 1 

Isolation of nlant cDNAs that convert b -narotene into compounds with ketocarotenoid- 

llke spectra 

20 A flower cDNA library from the plant Adonis aestivalis was Introduced into a 

strain of Escherichia coli engineered to accumulate the yellow carotenoid pigment Pr 
carotene (see Cunningham et al.. Plant Cell 8:1613-26, 1996). This strain of E. coli 
normally forms yellow colonies when cultures are spread on a solid agar growth 
medium. Ketocarotenoids that are derived from b-carotene, such as echinenone and 

25 canthaxanthin (Figure 1 ), are, in contrast, orange to orange-red in color. Colonies that 
were orange rather than yellow in color were visually selected, and the DNA sequences 
of the Adonis aestivalis cDNAs within the plasmid vectors contained in these colonies 
were ascertained. Two distinct cDNAs were obtained from analysis of cDNA inserts in 
plasmids obtained from approximately 10 selected colonies. The DNA sequences of 

30 these two ketolase cDNAs are presented herein. 

The products produced by the ketolases of the invention which have been 
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expressed in a 3-carotene-accumulating strain of Eschericia coli have not yet been 
identified. As many as 5 or 6 different colored bands, in addition to the substrate P- 
carotene, may readily be discerned by C,^ TLC separation (see Figure 3). To provide 
appropriate standards to assist in identification, an H. pluvialis ketolase and an 
5 Arabidopsis P-carotene hydroxylase were separately introduced into the p-carotene- 
accumulating E. coli to produce echinenone (3-keto-p.p-carotene) and canthaxanthin 
(3.3'-diketo-P,p-carotene) or p-cryptoxanthin (4-hydroxy-P,p-carotene) and zeaxanthin 
{4,4'-dihydroxy-p,p-carotene). None of the compounds formed in the presence of the 
ketolases of the invention (no difference was observed in products formed in the 

10 presence of the two different nucleic acid sequences of the invention) both migrate in 
the TLC system and have the absorption spectrum expected for echinenone, 
canthaxanthin, p-cryptoxanthin, or zeaxanthin. Two of the colored TLC bands produced 
in the presence of the Adonis ketolase cDNAs are orange in color. Orange band #1 
has an absorption spectrum similar to that of canthaxanthin (see Figure 4) but migrates 

15 in a position that indicates a polarity intermediate to echinenone and p-carotene. 
Orange band #2 also has an absorption spectmm like that of canthaxanthin but 
migrates in a position that indicates a polarity intermediate to canthaxanthin and 
zeaxanthin (see Figure 3). The absorption spectra and TLC results suggest that the 
two orange products could be desaturated at the 3-4 positions of both rings (3,4,- 

20 didehydro; see Figure 2). Orange band #1 (see Figure 3) might then be 3,4,3',4'- 
tetradehydro-3,p-carotene. To substantially affect the absorption spectrum of the 
substrate P-carotene, any modifications very likely involve a carbon that lies in 
conjugation with the conjugated chain of carbon-carbon double bonds that constitute 
the chromophore (Goodwin, 1980; The BinrhP mistry of tha Carotenntris , volume I; 2"^ 

25 edition, Chapman and Hall). For the spectra obtained, only the carbons at the number 
4 position of the two rings appear to be plausible locations for modification. The 
multitude and TLC migrations of the yellow and orange products produced from the 
symmetrical P-carotene, however, also indicates that the enzymes of the invention cany 
out more than a single type of reaction. The apparent homology of the ketolases of the 

30 invention to the Arabidopsis P-carotene hydroxylase would suggest that compounds 
with a hydroxyl at the 3 and/or 4 positions of one or both rings are another possible 
outcome (see Figure 2). In fact, such compounds have been identified In Adonis (see 
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above), and it has long been conjectured that.a hydroxyl at position 4 is an intermediate 
in the formation of the 4-keto (e.g. crustaxanthin, a 3.3\4,4' tetrahydroxy carotenoid that 
might be a precursor for astaxanthin in the exoskeleton of the lobster). The histidine 
motifs and secondary structure in common to the hydroxylase and ketolase enzymes 
5 are characteristics of a large group of di-iron oxygenases whose members also include 
examples of desaturases (J. Shanklin, 1998, Ann. Rev, Plant Physiol. Plant MoL BioL), 
therefore a 3-4 desaturation (and/or perhaps a 2-3 desaturation in one or more of the 
yellow compounds) would also seem a plausible outcome. 

To summarize the results of this example for the Adonis ketolases of the 

10 invention, a number of different carotenoids, including two with ketocarotenoid-like 
spectra, are produced from p-carotene via the action of the products of either of the two 
different nucleic acids of the invention. These orange compounds appear to be the 
major products. Truncation and fusion of the cDNAs to a stronger promoter In the 
vector pTrcHls (Invitrogen) was detrimental to growth of E. coli but did result in 

15 Improved yield of the most polar orange product (orange band #2 in Figure 3). 
Introduction of a cyanobacterial ferredoxin did not change the yield or relative amounts 
of the various products. Without being bound by theory, it may be that the 
ketocarotenoids produced in flower petals of Adonis actually include the as yet 
unidentified orange compounds that are produced in E. coli using the nucleic acids of 

20 the Invention. 

EXAMPI F 9 
Substrate specificity of the Adonis ketnlaRf>Q 

Carotenoids with 8 rings are common In plants. The 8 ring differs from the b ring 
only in the position of the double bond within the ring (Figure 2). The e ring Is reported 

25 to be a poor substrate for the Arabidopsis b-carotene hydroxylase (Sun et a!., 1996). 
The Adonis ketolase cDNAs were introduced into strains of E. coli engineered 
(Cunningham et al., 1996) to accumulate carotenoids with one or two 8 rings (d- 
carotene and 8-carotene), or the acyclic carotenoid lycopene. TLC analysis of acetone 
extracts revealed that these carotenoids were not modified by the Adonis ketolases. 

30 as Indicated by a lack of any new products formed. Products produced In E. coli 
engineered to accumulate zeaxanthin (Sun et al., 1996) appeared to be the same as 
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for p-carotene accumulating cultures indicating that a 3-OH is likely to be one of the 
functional groups introduced to the b ring by the Adonis ketolases. The more polar 
orange band produced from b-carotene through the action of the Adonis ketolases (e.g., 
orange band 2 in Figure 3), therefore, could very well be 3.3'-dihydroxy-3,4,3',4'- 
5 tetradehydro-b.b-carotene. 

The references cited in the application, along with the following references, are 
incorporated by reference: 

Bouvier F, et al. (1998) Xanthophyll biosynthesis: molecular and functional 
characterization of carotenoid hydroxylases from pepper fruits (Capsicum annuum L.). 
1 0 Biochim Biophys Acta. 1 391 :320-8 

Breitenbach J, et al. (1996) Expression in Escherichia coli and properties of the 
carotene ketolase from Haematococcus pluvlalis. FEMS Microbiol Lett. 140:241-6 

Cunningham FX Jr. Gantt E (1998) Genes and enzymes of carotenoid biosynthesis in 
plants. Ann Rev Plant Physiol Plant Mol Biol 49: 657-583 

1 5 Fernandez-Gonzalez B, et al. (1 997) A new type of asymmetrically acting beta-carotene 
ketolase is required for the synthesis of echinenone in the cyanobacterium 
Synechocystis sp. PCC 6803. J Biol Chem. 272:9728-33 

Fraser PD, et al. (1997) In vitro characterization of astaxanthin biosynthetic enzymes. 
J Biol Chem. 1997272:6128-35 

20 Fraser PD, et al. (1998) Enzymic confirmation of reactions involved in routes to 
astaxanthin formation, elucidated using a direct substrate in vitro assay. Eur J Biochem. 
252:229-36 

Marker M, et al. (1997) Biosynthesis of ketocarotenoids in transgenic cyanobacteria 
expressing the algal gene for beta-C-4-oxygenase, crtO. FEBS Lett. 404:129-34 
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Kajiwara S, et al. (1995) Isolation and functional identification of a novel cDNA for 
astaxanthin biosynthesis from Haematococcus pluvialis, and astaxanthin synthesis in 
Escherichia coli. Plant Mol Biol. 29:343-52 

Lotan T, et al. (1995) Cloning and expression in Escherichia coli of the gene encoding 
5 beta-C-4-oxygenase, that converts beta-carotene to the ketocarotenoid canthaxanthin 
in Haematococcus pluvialis. FEBS Lett. 364:125-8 

Misawa N, et al. (1995) Canthaxanthin biosynthesis by the conversion of methylene to 
keto groups In a hydrocarbon beta-carotene by a single gene. Biochem Biophys Res 
1 0 Commun.209:867-76 

Misawa N, et al. (1995) Structure and functional analysis of a marine bacterial 
carotenoid biosynthesis gene cluster and astaxanthin biosynthetic pathway proposed 
at the gene level. J Bacteriol. 177;6575-84 

Miura Y, et al. (1998) Production of the carotenoids lycopene, beta-carotene, and 
15 astaxanthin in the food yeast Candida utilis. Appl Environ Microbiol. 64:1226-9 

Shanklin J, et al. (1997) Mossbauer studies of alkane omega-hydroxylase: evidence for 
a diiron cluster in an integral-membrane enzyme. Proc Natl Acad Sci USA. 94:2981-6 

Shanklin J, Cahoon EB (1998) Desaturation and related modifications of fatty acids. 
Ann Rev Plant Physiol Plant Mol Biol 49: 61 1-641 

20 Wang CW, et al. Engineered isoprenoid pathway enhances astaxanthin production in 
Escherichia coli. Biotechnol Bioeng. 1999 Jan 20;62(2):235-41. 
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claim: 



1 . A method of producing a ketocarotenoid in a host cell, the method comprising 
inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity, and has the 
nucleic acid sequence of SEQ ID NO: 1 or 3. wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

2. The method of claim 1. wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

3. A method of producing a ketocarotenoid in a host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having ketolase enzyme activity and has a 
sequence which encodes the amino acid sequence of SEQ ID NO: 2 or 4. wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing 
the ketocarotenoid. 

4. The method of claim 3. wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

5. A method of modifying the production of carotenoids in a host cell, relative to an 
untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
ketolase enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1 or 3. 
wherein the heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence In the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
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host cell. 

6. The method of claim 5. wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

7. A method of modifying the production of carotenoids in a host cell, relative to an 
5 untransformed host cell, the method comprising 

inserting into a host cell which already produces carotenoids a vector 
comprising a heterologous nucleic acid isequence which encodes for a protein having 
ketolase enzyme activity and has a sequence which encodes the amino acid sequence 
of SEQ ID NO: 2 or 4. wherein the heterologous nucleic acid sequence is operably 
10 linked to a promoter; and 

expressing the heterologous nucleic acid sequence in the host cell to 
modify the production of the carotenoids in the host cell, relative to an untransformed 
host cell. 

8. The method of claim 7, wherein the host cell is selected from the group 
15 consisting of a bacterial cell, an algal cell and a plant cell. 

9. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 1 . 

10. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has the nucleic acid sequence of SEQ ID NO: 3. 

20 11. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
ID NO: 2. 

12. A purified nucleic acid sequence which encodes for a protein having ketolase 
enzyme activity and has a sequence which encodes the amino acid sequence of SEQ 
25 ID NO: 4. 
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IS. A vector which comprises the nucleic acid sequence of any one of claims 9-12. 
wherein the nucleic acid sequence is operably linked to a promoter. 

14. A host cell which is transformed with the vector of claim 1 3. 

15. The host cell of claim 14, wherein the host cell Is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

16. The host cell of claim 14, wherein the host cell Is a photosynthetic cell. 

17. The host cell of claim 14, wherein the host cell contains a ketocarotenoid. 

18. The host cell of claim 14, wherein the host cell contains modified levels of 
carotenoids, relative to an untransformed host cell. 

10 19. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 2. 

20. A purified ketolase enzyme which is encoded by the amino acid sequence of 
SEQ ID NO: 4. 
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Figure 5 [SEQ ID NO: 5] 

-23 ggg ctgcaggaat tcggcacgag 

1 agcaaucrca grgttcagta caagttattc tttccacaag aatctcttgt 

51 tgcac::caaa acaagacatt ctcaaccgcc catgtttgct cttctctcca 

101 gttgrggrgg agtcgcctat gagaaagaaa aagacacatc gtgctgcatg 

151 tatctgctct gtcgcagaga gaacaaggaa cctitgatatt: cctcaaatrg 

201 aagaagagga agagaacgag gaagaactaa tagaacagac ggattctggc 

251 acaatccaca eaaagaaaac gctagggggg aaacaatcaa gacggtccac 

301 tggcwcca-r gtcgcacccg tatcctgcct tgggatcctt tcaatgatcg 

351 gacccgcrg- rcactucaag ttttcacggc taatggagtg tggagatatt 

401 cctgtcgcag aaatggggat tacgtttgcc gcctttgttg ctgctgcgat 

451 tggcacggaa rctttgtcag gatgggttca caaagaactc tggcacgatt 

501 Cuttg^ggra cattcacaag tctcaccata ggtcacgaaa aggccgcttc 

551 gagtzcaarg atgtgtttgc tattattaac gcgcttcctg ctattgctct 

601 tatcaatcat: ggattctcaa atgaaggcct ccttcctgga gcctgctttg 

651 gtaccggrcr rggaacgaca gtctgtggca tggcttacat ttttcttcac 

701 aatggccrrr cacaccgaag gttcccagta gggcttattg caaacgtccc 

751 ttartzccaz aagctggctg cagctcacca aatccatcac tcaggaaaar 

801 ttcagggcgr accatttggc ctgttccttg gaccccagga attggaagaa 

851 gtaagagcag gcactgaaga attggagagg gtgatcagtc gtacagctaa 

901 acgaacgcaa ccatctacat gaatcaactc ttttacattt atgaggtttt 

951 agtttatcgg tgttacaagt cacacatttg tgtcgttgta gtaattcaaa 

1001 gttaccarac tcttttttag aatttttttt tgatgtatag gtcgcggagt 

1051 tacggrtaca aaggccaaat ctattgttgt ggaattccat tattaaaaat 

1101 aaaaatcaga -gtttgtagtt ttatctggtg atcaatatca atatatatta 

1151 attaaagcaa aaaaaaaaaa aaaaaa ctcgag 
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Figure 6 [SEQ ID NO: 6] 

MGLQEFGTR 

aisvfstsys fhknlllhsk qdilnrpcll fspvwespm rkkkthraac 
icsvaertrn Idipqieeee eneeelieqt dsgiihikkt Iggkqsrrst 
gsivapvscl cilsmigpav yfkfsrlmec gdipvaemgi tfaafvaaai 
gteflsgwvh kelwhdslwy ihkshhrsrk grfefndvfa iinalpaial 
inygfsnegl Ipgacfgtgl gttvcgmayi flhnglshrr fpvglianvp 
yfhklaaahq ihhsgkfqgv pfglflgpqe leevrggtee iervisrtak 
rtasst* 
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Figure 7 [SEQ ID NO: 7] 



-23 






ggg 


ctgcaggaat 


tcggcacgag 


1 


agcaatttca 


gtgttcagtt 


caggctattc 


tttctacaag 


aatctcttgt 


51 


tggactcaaa 


accaaatatt 


ctcaaacccc 


catgcctgct 


attctctcca 


101 


gttgtgatca 


tgtcgcctat: 


gagaaagaaa 


aagaaacatg 


gtgatccatg 


151 


tatctgctcc 


gttgcaggga 


gaacaaggaa 


ccttgatatt 


cctcaaattg 


201 


aagaagagga 


agagaatgtg 


gaagaactaa 


tagaacagac 


cgattctgac 


251 ^ 


atagtgcata 


taaagaaaac 


actagggggg 


aaacaatcaa 


aacggcccac 


301 


tggctccacr. 


cticgcacccg 


raccrigtct 


tgggatcctt 


tcaatgattg 


351 


gacctgccgt 


rcactticaag 


rrtzcacggc 


taatggaggg 


tggagarata 


401 


ccrgtagcag 


aaatggggat 


tacgcrcgcc 


acct wtgttg 


ctgctgcrgt 


451 


tggcacggag 


ttitttgtcag 


catgggttca 


caaagaactc 


tggcacgagt 


501 


ctttgtggta 


catccacaag 


tctcaccatc 


ggtcacgaaa 


aggccgcttc 


551 


gagttcaatg 


atgtgtttgc 


tarcaiztaac 


gcgcttcccg 


ctattgctct 


601 


tatcaattat 


ggattctcca 


acgaaggcct 


ccttcctgga 


gcgtgctirtg 


651 


gtgtcggtct 


cggaacaaca 


Gccrgtggta 


tggcttacat 


ttttctzcac 


701 


aatggccuat 


cacaccgaag 


gtccccagta 


tggcttattg 


cgaacgcccc 


751 


ttatttccac 


aagctggctg 


cagcrcacca 


aatacaccac 


tcaggaaaat 


801 


ttcagggrgt 


accatttggc 


ctgrrccttg 


gacccaagga 


attggaagaa 


851 


gtaagaggag 


gcactgaaga 


gtcggagagg 


gtaatcagtc 


gtacaactaa 


901 


acgaacgcaa 


ccatctacct 


gaaucaattt 


ttttacatat 


ataaggtittt 


951 


agtttatcgg 


tgttataaaa 


tcacacatcc 


gtatcgtttt: 


agtaagncaa 


1001 


agttaagata 


cttccttctt 


agaatatttt 


ttgatgtata 


ggtcgcggat 


1051 


atactgttac 


actattcgtt 


gtggaattcc 


attataaaaa 


aataaaaaaa 


1101 


aaaaaaaaaa 


aa ctcgag 
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Figure 8 [SEQ ID NO: 8] 



MGLQEFGTR 



aisvfssgys fyknilldsk pnilkppcll fspwimspm rkkkkhgdpc 
icsvagrtrn laipqieeee enveelieqt dsdivhikkt Iggkqskrpt 
gsivapvscl gilsmigpav yfkfsrlmeg gdipvaemgi tfatfvaaav 
gteflsawvh kelwheslwy ihkshhrsrk grfefndvfa iinalpaial 
inygfsnegl Ipgacfgvgl gttvcgmayi flhnglshrr fpvwlianvp 
yfhklaaahq ihhsgkfqgv pfglflgpke leevrggtee lervisrttk 
rtqpst* 
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Figure 9: Gap of SEQ ID NO: 9 and SEQ ID NO: 3 

1 agcaatctcagcgttcagtacaagttattctttccacaagaatctcttgt 50 

Mini IIMIIIIIIM II lllllllllli IIIIMIIIIIIIII 

1 agcaatttcagtgttcagttcaggttattctttctacaagaatctcttgt: 50 
51 tgcactcaaaacaagacattctcaaccgcccatgtttgctcttictctcca 100 

II lllllllll I I illltlll't llllll MM IIIIIIIM 

51 tggactcaaaaccaaatattctcaaacccccatgcctgctattctctcca 100 
101 gttgcggrggagccgcctatgagaaagaaaaagacacatcgtgctgcatg 150 

llllll I III MM I III MM MUM II MM III I MM 

101 gttgtgatcargtcgcctatgagaaagaaaaagaaacatggtgatccatg 150 
- 

151 t:atctgctct:gttgcagagagaacaaggaaccrt:gatat.tcct:caaatt:g 200 

IIMIItl! IIIMII IMIIiilllllllllliMIIIIIIIIIIII 

151 tatctgccccgttgcagggagaacaaggaaccttgatattcctcaaattg 200 
2 01 aagaagaggaagagaacgaggaagaactaatagaacagacggat'tctggc 250 

IIIIIIIIMIIIIII I IIIIIIIMIIillMIIIII IIIMII 1 

2 01 aagaagagcaagagaatgtggaagaactaatagaacagaccgattctgac 250 

• • • • . * 

251 ataattcacataaagaaaacgctaggggggaaacaatcaa;gacggtccac 300 

III I MM MUM MM IIIIIIIMMIIilllll MM MM 

251 atagtgcat:at:aaagaaaacact:aggggggaaacaat.caaaacggcccac 3 00 

- • • • 

301 tggctccactgtGgcacccgtatcttgtcttgggatcctttcaatgatcg 350 

llllillllllllllllMIIIIIIIIIMMIIIIMIIIIIIMll I 

301 tggctccatcgtcgcacccgtatcttgtcttgggatcctttcaatgattg 350 



BNSDOCID:<WO_...9S616S2Al I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/61652 ^ PCT/US99/104SS 

10/16 

Figure 9 (cont.) 

351 gacctgctgtctacttcaagttttcacggctaatggagtgtggagatatt 400 

MIIIIIMIIIIMIIMIMIIINIilllllllM illlllllll 

351 gacctgctgtctacttcaagtcttcacggctaatggagggtggagatata 400 
401 cctgtcgcagaaatggggattacgtttgccgcctttgttgctgctgcgat 450 

Mill llllllllilllilMillilMI MIMIIIIMIIMI I 

401 cctgtagcacaaatggggattacgtttgccacctttgttgctgctgctgt 450 
451 cggcacggaazttttgccaggatgggtwcacaaagaacrctggcacgatt, 500 

lllllllll IIMIillM IMIIIIIIIIIIMIIMililllll 1 

451 tggcacggagtttttgtcagcatgggttcacaaagaactctggcacgagt 500 

• • • * 

501 cuttgtggnacattcacaagtctcaccataggtcacgaaaaggccgcttc 550 

IIIIIIIIIilllllMIIIMIIIIIIl llllllilllllllllilll 

501 ccttgcggtacattcacaagtctcaccatcggtcacgaaaaggccgcttc 550 

, • • • * 

551 gagttcaatgatgtgtttgctattattaacgcgcttcctgctat'tgctct- €00 

lllillllMIIIIIIMMIIIMIIMMIIIIIII IIIIIMMM 

551 gagttcaatgacgtgtttgctattattaacgcgcttcccgctattgctct 600 
601 tatcaattatggattctcaaatgaaggccrccttcctggagcctgctttg 650 

Miliillllliilllll lllilllllllllllliillill IIIIIM 

601 tatcaattatggattctccaatgaaggcctccttcctggagcgtgctttg 650 
651 gtaccggtcttggaacgacagtctgtggcatggcttacatttttcttcac 700 

II lllllllllili lllliliiili lllililllMIIMNilll 

651 gtgtcggtcttggaacaacagtctgtggtatggcttacatttttcttcac 700 
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701 aatggcctttcacaccgaaggttcccagtagggcttattgcaaacgtccc 750 

1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I i 1 1 1 1 

7 01 aatggcctatcacaccgaaggttcccagtatggcttattgcgaacgtccc 750 
751 ttatttccacaagctggctgcagctcaccaaaticcatcact caggaaaat 8 00 

lllllllllilillllMIIIMIMMIMM II I I I I 1 1 I I 1 1 1 1 I 

751 tt at ttccacaagctggcrgcagctcaccaaatacaccact caggaaaat 800 

8 01 tccagggtgraccatttggcccgtticctrggaccccaggaatuggaagaa 850 

IlilillliiillllllllllliMlllllMIII Mlilllillllll 

8 01 ttcagggtgtaccatttggcctgttccttggacccaaggaattggaagaa 850 
851 gtaagaggaggcactgaagaattggagagggtgat cagt cgtacagctaa 900 

IIIMIIIillllllillll IlillllllM 1 1 1 1 1 1 I i 1 1 I M 1 1 1 

851 gtaagaggaggcactgaagagttggagagggtaatcagtcgtacaactaa 900 
901 acgaacgcaatcatctacaTGAatcaactcttttacatttatgaggtttt 950 

iiiiiiiiii liiiiii iiiiiiii I iiiiiiii ill iiiiiii 

901 acgaacgcaaccatctaccTGAatcaattuttttacatatataaggtttt 950 
951 agtttatcggtgtta . caagtcacacatttgtgtcgttgtagtaattcaa 999 

lllllllllllllll ' II IIIIIIII M lllli llllll MM 

951 agtttatcggtcttataaaatcacacatccgcatcgttttagtaagtcaa 1000 

1000 agttaccatactcttttttagaatttttttttgatgtataggtcgcggag 1049 

Mill Mill III II IIIIIIIIIIIIIIIIIMMM 

1001 agttaagatacttccttcttagaatattttttgatgtataggtcgcggat 1050 



PCT/US99/10455 

11/16 



BNSOOCID: <WO_9861652A1J_> 



SUBSTITUTE SHEET (RULE 26) 



wo 99/61652 




PCTAJS99/10455 



12/16 

Figure 9 (cont.) 

1050 ttiacggcuacaaacgccaaatctiattgtitgtiggaattccattattaaaaa 1099 

Mi Mill I I IMMIMIMIIIIIM Mlli 

1051 atactgctac actattcgttgtggaattccattataaaaaa 1091 

1100 taaaaatt:agagcrtgtagtttt:at:ctggtgatcaatatcaatiat.atatt 1149 

MM i I 

1092 ataaaaaaaaaaaaaaaaaaa 
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Figure 10: Gap of SEQ ID NO: 2 and SEQ ID NO: 4 

1 AISVFSTSYSFHKNLLLHSKQDILNRPCLLFSPV\A^SPMRKKKTHRAAC 50 



51 ICSVAERTRNLJDIPQIEEEEENEEELIEQTDSGIIHIKKTLGGKQSRRST 100 

Mill I !! I I II I I I II I M I I .1 II I I I I I I : I I I I I I I I I I I : I I 
51 ICSVAGRTRNLDIPQIEESEENVEELISQTDSDIVHIKKTLGGKQSKRPT 100 

• • • • * 

101 GSIVAPVSZLGILSMIGPAVYFKFSRL.KECGDIPVAEMGITFAAFVAAAI 150 

I i 1 1 1 1 1 ! : I i I i i I 1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 I i I I i I I I I I I I I h 

101 GSIVAPVSCLGILSMIGPAVYFKFSRLMEGGDIPVAEMGITFATFVAAAV 150 
151 GTEFLSGW\--JKEI.WHDSLWYIHKSHKRSRKGRFEFNDVFAIINALPAIAL 200 

I I III I lillMMHIillillllllinilllllllMlililllli 

151 GTEFLSAW\-^KELWKESLWyiHKSHHRSRKGRFEFNDVFAIINAIiPAIAIi 200 
2 01 INYGFSNEGLLPGACFGTGLGTTVCGMAYIFIjHNGLSHRRFPVGLIA 250 

Mini Mill Mini iiiininiiiniiiiiiniii nini 

201 INyGFSNE3LL.PGACFGVGLGTTVCGMAYIFLHNGLSHRRFPVWLIANVP 250 
251 YFKKLAAAKQIHKSGKFQGVPFGLFLGPQELEEVRGGTEELERVISRTAK 300 

llllilllMIIIIMIIIIIIIIIIII-lllllllllllllllllll I 

251 YFHKLAAAHQIHKSGKFQGVPFGLFLGPKELSEVRGGTEELERVISRTTK 300 

301 RTQSST* 307 • 

III 111 
301 RTQPST* 307 
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Figure 1 1 : Gap of SEQ ID NO: 2 and Arabidopsis p-carotene hydroxylase <SEQ 
ID NO: 10) 

1 AISVFSTSYSFHKNLLLHSKQDILNRPCLLFSPWVESPMRKKKTHRAAC 50 

■ I I • . I II . .II- I • 

1 MAAXLSTAVTFKP. . .LHRSFSSSSTDFRLRLPKSLSGFSPSLRFKRFSV 47 

51 ICSVAERTRNLDIPQIEEEEENEEELIEQTDSGIIHIKKTLGGKQSRRST 100 

i II -I I II : : : . I |.|H| 

4 8 CYWEERRQNSPIENDERPESTSSTKAIDAEYLALRIiAEKLERiaCSERST 97 

101 GSIVAPVSCLGILSMIGPAVYFKFSRLMECGDIPVAEMGITFAAFVAAAI 150 

n -I II II I II:: I I II hi • II ill I Ih 
98 YLIAAMliSSFGITSMAVMAVYYRFSWQMEGGEISMLEMFGTFALSVGAAV 147 

151 GTEFLSGWVHKELWHDSLWYIHKSKKRSRKGRFEFNDVFAIINALPAIAL 200 

I II • I h Mi III -Mih M II lllll|:|l III I 

148 GMEFWARWAHRALWHASLWNMHESrZ-IKPRSGPFELNDVFAIWAGPM 197 

201 INYGFSNSGLLPGACFGTGLGTTVCGMAYIFLHNGLSHRRFPVGLIANVP 250 

:-lll MM! Ill III II l-li-l-i-ll hill I I I hi I 
198 LSYGFFNKGLVPGLCFGAGLGITVFGIAYMFVHDGLVHKRFPVGPIADVP 247 

251 YFHKLAAAHQIHHSGKFQGVPFGLFLGPQELEEVRGGTEELERVISRTAK 300 

I hllllhlh M lihillllhlllll II III:: IN i 
248 YLRKVAAAHQLHHTDKFNGVPYGLFLGPKELEEV.-GGNEELDKEISiUlIK 296 

301 RTQSST* 307 

297 SYKKASGSGSSSSS* 311 
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Figure 12A (SEQ ID NO: 11) 

1 CATACCATAA ATAGTAGAGG ACAACCTACA AACCAACCAC CAGAAACZTC 50 
51 CAATGGCAGC 



Figure 12B (SEQ ID NO: 12) 

maaaisvfssgys~yknllldskpnilk??cllf3?vvims?mrkk:^^ 

trnld i ?q i ezeeenvzeli eqtdsd i vhi :<:<ri.ggkqs krptgs i vapvs clgi ismig 

PAVYFKJSPiMEGGDIPVAEMGITFATF^/AAAVGTErLSAWVEKZLWKH 
SRXGRJEFNDVF AI INALPAIAIiINYGFSNZGLI.?GAC?GVGLGTTVCGMAYI FLHNGLS 
HRRFF'^rwI.I ANV? YFKKLAAAKQ IHKSGKFQGV? FGLFLG? KSLZEVRGGTSELERVI SR 
TTKRTQPST* 
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Figure 13 



Atl 

At2 

Cal 

Ca2 

ildKl 

AdK6 



* 20 * 40 * 60 

MAAXLSTAVTFKPLHRSFSSSSTDFRLRLPKslsgf spslR fkrgsvcg 

MAAGLSTlAVTLKPI*NRSSFSANHPl3tavfppalRFNGFRR rlti Jtvcfc 

MAAEISISASSRAICLQRNPFPAPKYFATAPpllf f 3pltCNIJ3AII*RSRR3cpr JaacF 

TTGRYHYQLVWCQISFSSTSRTSyYRHSPFLGPKPTPTTPSVYpitpf spxilGSII*RCRR rpsgtvc 



- AX SVFSTSBFHKNI.LlHSKQDllJmpll 
-KAAAISVTSSGgFYKKLLlDSKPljzJCPpgl 



ipBi 



've : 


52 


■ve : 


53 


l)c : 


62 


le £ 


71 


'ae : 


56 


«g : 


59 



FSP 



C V 



80 * 100 

Atl : errqNSPXEITDERPESTSSTNAIDAEYLAL 

At2 : erkqSSPMDDDN^ESTTSSSEZLHTS 

Cal £ ddklYTAQSGKQSDTEAZGDEIEVETNEEKSIJl' 

Ca2 : ddJcfKTQFEAGEEBXSMKZSSQISAT 

AdKl : rSrateaofeEEEENTrKTrffljEOTDSGII 

AdK6 : rflriiSl^mQiEEBEENVEESiEQTDSDZV 




Atl 

At2 

Cal 

Ca2 

AdKl 

AdK6 




180 
svg-aavgje fj 
sVgaawg"j.ef] 
fg-aaigge; 
Jsvg - aavg»5e £| 
tf aaSva-aaigSef 
tf atjva-aavgfef 
XFA V Aa G B£ 




llwhas 1 
IwhdalwnM 
Iwhaalwbt; 
Iwliaa Iwj 
Iwhdal 
'iwhasl 
1MB SZ.W 




* 220 

felzidvfaivnagpai : 195 

f elndvf aitnavpai : 194 

felndlfaiinavpal : 209 

f elndvfailnavpai : 210 

f efndvfaixnalpai : 198 

f efndvfailzialpax : 201 
FE NDvFAX HA PAZ 



Atl : gllsygffx 

At2 : gllyyg£lx 

Cal : a££sfg£z 

Ca2 : alldygffl 

AdKl : alinygfsz 

AdK6 : alinygfsz 
1 yGF 




jlipg^ etc 
jllpgj cfgtgls 
jllpgjcfgvglgj 



Itv "giayfflf , 

Itv Jgmaygf] 
jtv i gmaygf I 
|tv| gmayaf 
Itvggaayjf; 




♦ 280 

irfpvgS iadvpylrkSaaalK 
f pvg{! ianvpylrJcflaaaiit 
- £pvgg lakvpyfgrflaaahu 
IrfpvgE vanvpylrkSaaalis 
r £p vgji azxvpy £li}c9a a ali< 
{r £pvwgianvpy £hkjaaah( 



GJ* P6 CFG GL6 Tv GmAY F H 61. H RFPVg xA VPY k AAAHq 




300 • 
Atl : gl£lgpkeleevgg-neel< 
At2 : glflgpkqeveevgGkeal 
Cal : glflgpkeleevgv-ieel 
Ca2 : glflgpkeleevgg-leel 
AdKl : gl£lgpqeleevj|gGteel 
AdK6 s gl£lgpkeleev^Gteel 
GZiFLGPkalaBv g 




iksykkaSGSGSSSSS : 310 

klynkgSSTS : 305 

likslkrl : 315 

tryikga : 316 

^qasT : 306 

JqpsT : 309 

k 



a3cfiBq 
ltk3<3 
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SEQUENCE LISTING 



<110>. CUNNINGHT^, Francis -X. 

<120> CAROTENOID KETOLASE GENES AND GENE PRODUCTS, PRODUCTION 
OF KETOCAROTENOIDS AND METHODS OF MODIFYING CAROTENOIDS 
USING THE GENES 

<130> 8172-9022 

<140> Unknown 
<141> 1999-05-21 

<150> 60/086,460 
<151> 1998-05-22 

<160> 18 

<170> PatentIn Ver . 2.0 

<210> 1 
<211> 1176 
<212> DNA 

<213> Adonis aestivalis 



agcaatctca gtgttcagta caagttattc tttccacaag aatctcttgt tgcactcaaa 60 

acaagacatt ctcaaccgcc catgtttgct cttctctcca gttgtggtgg agtcgcctat 120 

gagaaagaaa aagacacatc gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 

ccttgatatt cctcaaattg aagaagagga agagaacgag gaagaactaa tagaacagac 240 

ggattctggc ataattcara taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 

tggctccatt gtcgcacccg tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 

ttacttcaag ttttcacgcc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 

tacgtttgcc gcctttgtug ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 

caaagaactc tggcacgatt ctttgtggta cattcacaag tctcaccata ggtcacgaaa 540 

aggccgcttc gagttcaarg atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 

tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 

tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 

gttcccagta gggcttattg caaacgtccc ttatttccac aagctggctg cagctcacca 780 

aatccatcac tcaggaaaat ttcagggtgt accatttggc ctgttccttg gaccccagga 840 

attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 

acgaacgcaa tcatctacat gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 

tgttacaagt cacacatttg tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 

aatttttttt tgatgtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 

ggaattccat tattaaaaat aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 

atatatatta attaaagcaa aaaaaaaaaa aaaaaa 1176 



<400> 



1 



<210> 2 



1 
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<211> 306 
<212> PRT 

<213> Adonis aestivalis 



<400> 2 

Ala lie Ser Val Phe 
1 5 

Leu His Ser Lys Gin 
20 

Pro Val Val Val Glu 
35 

Ala Cys lie Cys Ser 
50 

Gin He Glu Glu Glu 
65 

Asp Ser Gly He He 
85 

Arg Arg Ser Thr Gly 
100 

Leu Ser Met He Gly 
115 

Glu Cys Gly Asp He 
130 

Phe Val Ala Ala Ala 
145 

Lys Glu Leu Trp His 
165 

Arg Ser Arg Lys Gly 
180 

Asn Ala Leu Pro Ala 
195 

Gly Leu Leu Pro Gly 
210 

Cys Gly Met Ala Tyr 



Ser Thr Ser Tyr Ser Phe 
10 

Asp He Leu Asn Arg Pro 
25 

Ser Pro Met Arg Lys Lys 
40 

Val Ala Glu Arg Thr Arg 
55 

Glu Glu Asn Glu Glu Glu 
70 75 

His He Lys Lys Thr Leu 
90 

Ser He Val Ala Pro Val 
105 

Pro Ala Val Tyr Phe Lys 
120 

Pro Val Ala Glu Met Gly 
135 

He Gly Thr Glu Phe Leu 
150 155 

Asp Ser Leu Trp Tyr He 
170 

Arg Phe Glu Phe Asn Asp 
185 

He Ala Leu He Asn Tyr 
200 

Ala Cys Phe Gly Thr Gly 
215 

He Phe Leu His Asn Gly 



His Lys Asn Leu Leu 
15 

Cys Leu Leu Phe Ser 
30 

Lys Thr His Arg Ala 
45 

Asn Leu Asp He Pro 
60 

Leu He Glu Gin Thr 

80 

Gly Gly Lys Gin Ser 
95 

Ser Cys Leu Gly He 
110 

Phe Ser Arg Leu Met 
125 

He Thr Phe Ala Ala 
140 

Ser Gly Trp Val His 
160 

His Lys Ser His His 
175 

Val Phe Ala He He 
190 

Gly Phe Ser Asn Glu 
205 

Leu Gly Thr Thr Val 
220 

Leu Ser His Arg Arg 



2 
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225 230 235 



24 0 



Phe Pro Val Gly Leu lie Ala Asn Val Pro. Tyr Phe His Lys Leu Ala 
245 250 255 

Ala Ala His Gin He His His Ser Gly Lys Phe Gin Gly Val Pro Phe 
260 265 270 

Gly Leu Phe Leu Gly Pro Gin Glu Leu Glu Glu Val Arg Gly Gly Thr 
275 280 285 

Glu Glu Leu Glu Arg Val He Ser Arg Thr Ala Lys Arg Thr Gin Ser 
290 295 300 

Ser Thr 
305 



<210> 3 
<211> 1112 
<212> DNA 

<213> Adonis aestivalis 
<400> 3 

agcaatttca gtgttcagtt caggttattc 
accaaatatt ctcaaacccc catgcctgct 
gagaaagaaa aagaaacatg gtgatccatg 
ccttgatatt cctcaaattg aagaagagga 
cgattctgac atagtgcata taaagaaaac 
tggctccatt gtcgcacccg tatcttgtct 
ttacttcaag ttttcacggc taatggaggg 
tacgtttgcc acctttgttg ctgctgctgt 
caaagaactc tggcacgagt ctttgtggta 
aggccgcttc gagttcaatg atgtgtttgc 
tatcaattat ggattctcca atgaaggcct 
tggaacaaca gtctgtggta tggcttacat 
gttcccagta tggcttattg cgaacgtccc 
aatacaccac tcaggaaaat ttcagggtgt 
attggaagaa gtaagaggag gcactgaaga 
acgaacgcaa ccatctacct gaatcaattt 
tgttataaaa tcacacatcc gtatcgtttt 
agaatatttt ttgatgtata ggtcgcggat 
attataaaaa aataaaaaaa aaaaaaaaaa 

<210> 4 
<211> 306 
<212> PRT 

<213> Adonis aestivalis 



tttctacaag aatctcttgt tggactcaaa 60 
attctctcca gttgtgatca tgtcgcctat 120 
tatctgctcc gttgcaggga gaacaaggaa 180 
agagaatgtg gaagaactaa tagaacagac 240 
actagggggg aaacaatcaa aacggcccac 300 
tgggatcctt tcaatgattg gacctgctgt 360 
tggagatata cctgtagcag aaatggggat 420 
tggcacggag tttttgtcag catgggttca 480 
cattcacaag tctcaccatc ggtcacgaaa 540 
tattattaac gcgcttcccg ctattgctct 600 
ccttcctgga gcgtgctttg gtgtcggtct 660 
ttttcttcac aatggcctat cacaccgaag 720 
ttatttccac aagctggctg cagctcacca 780 
accatttggc ctgttccttg gacccaagga 840 
gttggagagg gtaatcagtc gtacaactaa 900 
ttttacatat ataaggtttt agtttatcgg 960 
agtaagtcaa agttaagata cttccttctt 1020 
atactgttac actattcgtt gtggaattcc 1080 

1112 
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<400> 4 

Ala lie Ser Val 
1 

Leu Asp Ser Lys 
20 

Pro Val Val . lie 
35 

Pro Cys lie Cys 
50 

Gin lie Glu Glu 
65 

Asp Ser Asp lie 



Lys Arg Pro Thr 
100 

Leu Ser Met lie 
115 

Glu Gly Gly Asp 
130 

Phe Val Ala Ala 
145 

Lys Glu Leu Trp 



Arg Ser Arg Lys 
180 

Asn Ala Leu Pro 
195 

Gly Leu Leu Pro 
210 

Cys Gly Met Ala 
225 

Phe Pro Val Trp 



Phe Ser Ser Gly 

"5 

Pro Asn lie Leu 



Met Ser Pro Met 
40 

Ser Val Ala Gly 
55 

Glu Glu Glu Asn 
70 

Val Kis lie Lys 
85 

Gly Ser He Val 



Gly Pro Ala Val 
120 

He Pro Val Ala 
135 

Ala Val Gly Thr 
150 

His Glu Ser Leu 
165 

Gly Arg Phe Glu 



Ala He Ala Leu 
200 

Gly Ala Cys Phe 
215 

Tyr He Phe Leu 
230 

Leu He Ala Asn 



Tyr Ser Phe Tyr 
10 

Lys Pro Pro Cys 
25 

Arg Lys Lys Lys 



Arg Thr Arg Asn 
60 

Val Glu Glu Leu 
75 

Lys Thr Leu Gly 
90 

Ala Pro Val Ser 
105 

Tyr Phe Lys Phe 



Glu Met Gly He 
140 

Glu Phe Leu Ser 
155 

Trp Tyr He His 
170 

Phe Asn Asp Val 
185 

He Asn Tyr Gly 



Gly Val Gly Leu 
220 

His Asn Gly Leu 
235 

Val Pro Tyr Phe 



Lys Asn Leu Leu 
15 

Leu Leu Phe Ser 
30 

Lys His Gly Asp 
45 

Leu Asp He Pro 



He Glu Gin Thr 
80 

Gly Lys Gin Ser 
95 

Cys Leu Gly He 
110 

Ser Arg Leu Met 
125 

Thr Phe Ala Thr 



Ala Trp Val His 
160 

Lys Ser His His 
175 

Phe Ala He He 
190 

Phe Ser Asn Glu 
205 

Gly Thr Thr Val 



Ser His Arg Arg 
240 

His Lys Leu Ala 
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24 5 250 . 255" " 

Ala Ala His Gin He His His Ser Gly Lys Phe Gin Gly Val Pro Phe 
260 265 270 

Gly Leu Phe Leu Gly Pro Lys Glu Leu Glu Glu Val Arg Gly Gly Thr 
275 280 285 

Glu Glu Leu Glu Arg Val He Ser Arg Thr Thr Lys Arg Thr Gin Pro 
290 295 300 

Ser Thr 
305 



<210> 5 
<211> 1205 
<212> DNA 

<213> Adonis aestivalis 
<400> 5 

gggctgcagg aattcggcac gagagcaatc tcagtgttca gtacaagtta ttctttccac 60 

aagaatctct tgttgcactc aaaacaagac attctcaacc gcccatgttt gctcttctct 120 

ccagttgtgg tggagtcgcc tatgagaaag aaaaagacac atcgtgctgc atgtatctgc 180 

tctgttgcag agagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaac 240 

gaggaagaac taatagaacs gacggattct ggcataattc atataaagaa aacgctaggg 300 

gggaaacaat caagacggtc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 

ctttcaatga tcggacctcc tgtttacttc aagttttcac ggctaatgga gtgtggagat 420 

attcctgtcg cagaaatccc gattacgttt gccgcctttg ttgctgctgc gattggcacg 480 

gaatttttgt caggatggct tcacaaagaa ctctggcacg attctttgtg gtacattcac 540 

aagtctcacc ataggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 

aacgcgcttc ctgctattcc tcttatcaat tatggattct caaatgaagg cctccttcct 660 

ggagcctgct ttggtacccg tcttggaacg acagtctgtg gcatggctta catttttctt 720 

cacaatggcc tttcacaccg aaggttccca gtagggctta ttgcaaacgt cccttatttc 780 

cacaagctgg ctgcagctca ccaaatccat cactcaggaa aatttcaggg tgtaccattt 840 

ggcctgttcc ttggacccca ggaattggaa gaagtaagag gaggcactga agaattggag 900 

agggtgatca gtcgtacacc taaacgaacg caatcatcta catgaatcaa ctcttttaca 960 

tttatgaggt tttagttta- cggtgttaca agtcacacat ttgtgtcgtt gtagtaattc 1020 

aaagttacca tactcttttt tagaattttt ttttgatgta taggtcgcgg agttacggtt 1080 

acaaaggcca aatctattcr tgtggaattc cattattaaa aataaaaatt agagtttgta 1140 

gttttatctg gtgatcaa-a tcaatatata ttaattaaag caaaaaaaaa aaaaaaaaac 1200 
tcgag 3^205 

<210> 6 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 



5 
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<4 00> 6 

Met Gly Leu Gin Gl\j Phe Gly Thr Arg Ala lie Ser Val Phe Ser Thr 
^ 5 10 15 

Ser Tyr Ser Phe His Lys Asn Leu Leu Leu His Ser Lys Gin Asp He 
20 25 30 

Leu Asn Arg Pro Cys Leu Leu Phe Ser Pro Val Val Val Glu Ser Pro 
35 40 45 

Met Arg Lys Lys Lys Thr His Arg Ala Ala Cys lie Cys* Ser Val Ala 
50 55 60 

Glu Arg Thr Arg Asn Leu Asp He Pro Gin He Glu <31u Glu Glu Glu 
6S 70 75 80 

Asn Glu Glu Glu Leu He Glu Gin Thr Asp Ser Gly He He His He 
65 90 95 

Lys Lys Thr Leu Gly Gly Lys Gin Ser Arg Arg Ser Thr Gly Ser He 
100 105 110 

Val Ala Pro Val Ser Cys Leu Gly He Leu Ser Met He Gly Pro Ala 
115 120 125 

Val Tyr Phe Lys Phe Ser Arg Leu Met Glu Cys Gly Asp He Pro Val 
130 135 140 

Ala Glu Met Gly He Thr Phe Ala Ala Phe Val Ala Ala Ala He Gly 
145 150 155 160 

Thr Glu Phe Leu Ser Gly Trp Val His Lys Glu Leu Trp His Asp Ser 
165 170 175 

Leu Trp Tyr He His Lys Ser His His Arg Ser Arg Lys Gly Arg Phe 
180 185 190 

Glu Phe Asn Asp Val Phe Ala He He Asn Ala Leu Pro Ala He Ala 
195 200 205 

Leu He Asn Tyr Gly Phe Ser Asn Glu Gly Leu Leu Pro Gly Ala Cys 
210 215 220 

Phe Gly Thr Gly Leu Gly Thr Thr Val Cys Gly Met Ala Tyr He Phe 
225 230 235 240 

Leu His Asn Gly Leu Ser His Arg Arg Phe Pro Val Gly Leu He Ala 
245 250 255 



wo 99/61652 PCTAJS99/1 0455 

Asn Val Pro Tyr Phe His Lys Leu Ala Ala Ala His Gin He His His 
260 265 270 

Ser Gly Lys Phe Gin Gly Val Pro Phe Gly Leu Phe Leu Gly Pro Gin 
275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu Leu Glu Arg Val He 
290 295 300 

Ser Arg Thr Ala Lys Arg Thr Gin Ser Ser Thr ' 
305 310 315 



<210> 7 
<211> 1141 
<212> DNA 

<213> Adonis aestivalis 



<400> 7 

gggctgcagg aattcggcac gagagcaatt tcagtgttca gttcaggtta ttctttctac 60 

aagaatctct tgttggactc aaaaccaaat attctcaaac ccccatgcct gctattctct 120 

ccagttgtga tcatgtcgcc tatgagaaag aaaaagaaac atggtgatcc atgtatctgc 180 

tccgttgcag ggagaacaag gaaccttgat attcctcaaa ttgaagaaga ggaagagaat 240 

gtggaagaac taatagaaca gaccgattct gacatagtgc atataaagaa aacactaggg 300 

gggaaacaat caaaacggcc cactggctcc attgtcgcac ccgtatcttg tcttgggatc 360 

ctttcaatga ttggacctgc tgtttacttc aagttttcac ggctaatgga gggtggagat 420 

atacctgtag cagaaatggg gattacgttt gccacctttg ttgctgctgc tgttggcacg 480 

gagtttttgt cagcatgggt tcacaaagaa ctctggcacg agtctttgtg gtacattcac 540 

aagtctcacc atcggtcacg aaaaggccgc ttcgagttca atgatgtgtt tgctattatt 600 

aacgcgcttc ccgctattgc tcttatcaat tatggattct ccaatgaagg cctccttcct 660 

ggagcgtgct ttggtgtcgg tcttggaaca acagtctgtg gtatggctta catttttctt 720 

cacaatggcc tatcacaccc aaggttccca gtatggctta ttgcgaacgt cccttatttc 780 

cacaagctgg ctgcagctca ccaaatacac cactcaggaa aatttcaggg tgtaccattt 840 

ggcctgttcc ttggacccaa ggaattggaa gaagtaagag gaggcactga agagttggag 900 

agggtaatca gtcgtacaac taaacgaacg caaccatcta cctgaatcaa tttttttaca 960 

tatataaggt tttagtttat cggtgttata aaatcacaca tccgtatcgt tttagtaagt 1020 

caaagttaag atacttcctt cttagaatat tttttgatgt ataggtcgcg gatatactgt 1080 

tacactattc gttgtggaat tccattataa aaaaataaaa aaaaaaaaaa aaaaactcga 1140 
9 1141 



<210> 8 
<211> 315 
<212> PRT 

<213> Adonis aestivalis 
<400> 8 

Met Gly Leu Gin Glu Phe Gly Thr Arg Ala He Ser Val Phe Ser Ser 

7 
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1 5 10 



15 



Gly Tyr Ser Phe Tyr Lys Asn Leu Leu Leu Asp Ser Lys Pro Asn lie 
20 25 30 

Leu Lys Pro Pro Cys Leu Leu Phe Ser Pro Val Val lie Met Ser Pro 
35 40 45 



Met Arg Lys Lys Lys 
50 

Gly Arg Thr Arg Asn 
65 

Asn Val Glu Glu Leu 
85 



Lys His Gly Asp Pro Cys 
55 

Leu Asp lie Pro Gin lie 
70 75 

He Glu Gin Thr Asp Ser 
90 



He Cys Ser Val Ala 
60 

Glu Glu Glu Glu Glu 

80 

Asp He Val His He 
95 



Lys Lys Thr Leu 
100 

Val Ala Pro Val 
115 

Val Tyr Phe Lys 
130 

Ala Glu Met Gly 
145 

Thr Glu Phe Leu 



Leu Trp Tyr He 
180 

Glu Phe Asn Asp 
195 

Leu He Asn Tyr 
210 

Phe Gly Val Gly 
225 

Leu His Asn Gly 



Asn Val Pro Tyr 



Gly Gly Lys Gin 



Ser Cys Leu Gly 
120 

Phe Ser Arg Leu 
135 

He Thr Phe Ala 
150 

Ser Ala Trp Val 
165 

His Lys Ser His 



Val Phe Ala He 
200 

Gly Phe Ser Asn 
215 

Leu Gly Thr Thr 
230 

Leu Ser His Arg 
245 

Phe His Lys Leu 



Ser Lys Arg Pro 
105 

He Leu Ser Met 



Met Glu Gly Gly 
140 

Thr Phe Val Ala 
155 

His Lys Glu Leu 
170 

His Arg Ser Arg 
185 

He Asn Ala Leu 



Glu Gly Leu Leu 
220 

Val Cys Gly Met 
235 

Arg Phe Pro Val 
250 

Ala Ala Ala His 



Thr Gly Ser He 
110 

He Gly Pro Ala 
125 

Asp He Pro Val 



Ala Ala Val Gly 
160 

Trp His Glu Ser 
175 

Lys Gly Arg Phe 
190 

Pro Ala He Ala 
205 

Pro Gly Ala Cys 



Ala Tyr He Phe 
240 

Trp Leu He Ala 
255 

Gin He His His 
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2 60 265 270 

Ser Gly Lys Phe Gin Gly Val Pro Phe Gly Leu Phe Leu Gly Pro Lys 
275 280 285 

Glu Leu Glu Glu Val Arg Gly Gly Thr Glu Glu . Leu Glu Arg Val lie 
.290 295 300 

Ser Arg Thr Thr Lys Arg Thr Gin Pro Ser Thr 
305 310 315 



<210> 9 
<211> 11^39 
<212> DNA • 

<213> Adonis aestivalis 

<400> 9 

agcaatctca gtgttcag-- caagttattc tttccacaag aatctcttgt tgcactcaaa 60 
acaagacatt ctcaaccccc catgtttgct cttctctcca gttgtggtgg agtcgcctat 120. 
gagaaagaaa aagacaca-c gtgctgcatg tatctgctct gttgcagaga gaacaaggaa 180 
ccttgatatt cctcaaatrg aagaagagga agagaacgag gaagaactaa tagaacagac 240 
ggattctggc ataattcata taaagaaaac gctagggggg aaacaatcaa gacggtccac 300 
tggctccatt gtcgcacccc tatcttgtct tgggatcctt tcaatgatcg gacctgctgt 360 
ttacttcaag ttttcacggc taatggagtg tggagatatt cctgtcgcag aaatggggat 420 
tacgtttgcc gcctttgttg ctgctgcgat tggcacggaa tttttgtcag gatgggttca 480 
caaagaactc tggcacgarr ctttgtggta cattcacaag tctcaccata ggtcacgaaa 54 0 
aggccgcttc gagttcaaTic atgtgtttgc tattattaac gcgcttcctg ctattgctct 600 ' 
tatcaattat ggattctcaa atgaaggcct ccttcctgga gcctgctttg gtaccggtct 660 
tggaacgaca gtctgtggca tggcttacat ttttcttcac aatggccttt cacaccgaag 720 
gttcccagta gggcttatrg caaacgtccc ttatttccac aagctggctg cagctcacca 780 
aatccatcac tcaggaaaa- ttcagggtgt accatttggc ctgttccttg gaccccagga 840 
attggaagaa gtaagaggag gcactgaaga attggagagg gtgatcagtc gtacagctaa 900 
acgaacgcaa tcatctacat gaatcaactc ttttacattt atgaggtttt agtttatcgg 960 
tgttacaagt cacacattrg tgtcgttgta gtaattcaaa gttaccatac tcttttttag 1020 
aatttttttt tgatgtatag gtcgcggagt tacggttaca aaggccaaat ctattgttgt 1080 
ggaattccat tattaaaaa- aaaaattaga gtttgtagtt ttatctggtg atcaatatca 1140 
atatatatt ^^^^^ 

<210> 10 

<211> 310 

<212> PRT 

<213> Arabidopsis 

<400> 10 

Met Ala Ala Xaa Leu Ser Thr Ala Val Thr Phe Lys Pro Leu His Arg 
1 5 10 15 



wo 99/61652 



PCTAJS99/10455 



Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys "Ser 
20 25 30 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro He Glu Asn Asp Glu 
SO 55 60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala 11^ Asp Ala Glu Tyr Leu 
^S 70 75 80 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
85 90 95 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
100 105 110 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
lis 120 125 

Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
145 150 155 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
ISS 170 175 

Glu Gly Pro Phe Giu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
180 185 190 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
1S5 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
210 215 220 

Ala Tyr Mez Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 
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Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
275 280 285 

Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
290 295 300 

Gly Ser Ser Ser Ser Ser 
305 310 



<210> 11 
<211> 60 
<212> DNA 

<213> Adonis aestivalis 
<400> 11 

cataccataa atagtagagg acaacctaca aaccaaccac cagaaacctc caatggcagc 60 

<210> 12 
<211> 309 
<212> PRT 

<213> Adonis aestivalis 
<400> 12 

Met Ala Ala Ala lie Ser Val Phe Ser Ser Gly Tyr Ser Phe Tyr Lys 
15 10 15 

Asn Leu Leu Leu Asp Ser Lys Pro Asn lie Leu Lys Pro Pro Cys Leu 
20 25 30 

Leu Phe Ser Pro Val Val lie Met Ser Pro Met Arg Lys Lys Lys Lys 
35 40 45 

His Gly Asp Pro Cys lie Cys Ser Val Ala Gly Arg Thr Arg Asn Leu 
50 55 60 

Asp lie Pro Gin lie Glu Glu Glu Glu Glu Asn Val Glu Glu Leu lie 
65 70 75 80 

Glu Gin Thr Asp Ser Asp lie Val His lie Lys Lys Thr Leu Gly Gly 
85 90 95 

Lys Gin Ser Lys Arg Pro Thr Gly Ser lie Val Ala Pro Val Ser Cys 
100 105 110 

Leu Gly lie Leu Ser Met lie Gly Pro Ala Val Tyr Phe Lys Phe Ser 
115 120 125 
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Arg Leu Met Glu Gly 
130 

Phe Ala Thr Phe Val 
145 

Trp Val His Lys Glu 
165 

Ser His His Arg Ser 
180 

Ala He He Asn Ala 
195 

Ser Asn Glu Gly Leu 
210 

Thr Thr Val Cys Gly 
225 

His Arg Arg Phe Pro 
245 

Lys Leu Ala Ala Ala 
260 

Val Pro Phe Gly Leu 
275 

Gly Gly Thr Glu Glu 
290 



Gly Asp He Pro Val Ala 
135 

Ala Ala Ala Val Gly Thr 

150 155 

Leu Trp His Glu Ser Leu 
170 

Arg Lys Gly Arg Phe Glu 
185 

Leu Pro Ala He Ala Leu 
200 

Leu Pro Gly Ala Cys Phe 
215 

Met Ala Tyr He Phe Leu 
230 235 

Val Trp Leu He Ala Asn 
250 

His Gin He His His Ser 
265 

Phe Leu Gly Pro Lys Glu 
280 

Leu Glu Arg Val He Ser 
295 



Glu Met Gly He Thr 
140 

Glu Phe Leu Ser Ala 
160 

Trp Tyr He His Lys 
175 

■Phe Asn Asp Val Phe 
190 

He Asn Tyr Gly Phe 
205 

Gly Val Gly Leu Gly 
220 

His Asn Gly Leu Ser 
240 

Val Pro Tyr Phe His 
255 

Gly Lys Phe Gin Gly 
270 

Leu Glu Glu Val Arg 
285 

Arg Thr Thr Lys Arg 
300 



Thr Gin Pro Ser Thr 
305 



<210> 13 

<211> 310 
<212> PRT 
<213> Arabidopsis 

<400> 13 

Met Ala Ala Xaa Leu Ser Thr Ala 
1 5 

Ser Phe Ser Ser Ser Ser Thr Asp 
20 

12 



Val Thr Phe Lys Pro Leu His Arg 
10 15 

Phe Arg Leu Arg Leu Pro Lys Ser 
25 30 
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Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys- 
35 40 45 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
50 55 -60 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala He Asp Ala Glu Tyr Leu 

"70 75 80 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
85 90 95 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
100 105 110 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
11- 120 125 

Glu He Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
130 135 140 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 

ISO 155 160 

Trp His Ala Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg 
165 170 175 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
180 185 190 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
195 200 205 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
210 215 220 

Ala Tyr Met Phe Val Kis Asp Gly Leu Val His Lys Arg Phe Pro Val 
225 230 235 240 

Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
245 250 255 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
260 265 270 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
275 280 285 
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Lys Glu He Ser Arg Arg He Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
290 295 300 

Gly Ser Ser Ser Ser Ser 
305 310 



<210> 14 

<211> 305 

<212> PRT 

<213> Arabidopsis 

<400> 14 

Met Ala Ala Gly Leu Ser Thr He Ala Val Thr Leu Lys Pro Leu Asn 
15 10 15 

Arg Ser Ser Phe Ser Ala Asn His Pro He Ser Thr Ala Val Phe Pro 
20 25 30 

Pro Ser Leu Arg Phe Asn Gly Phe Arg Arg Arg Lys He Leu Thr Val 
35 40 45 

Cys Phe Val Val Glu Glu Arg Lys Gin Ser Ser Pro Met Asp Asp Asp 
50 55 60 

Asn Lys Pro Glu Ser Thr Thr Ser Ser Ser Glu He Leu Met Thr Ser 
65 70 75 80 

Arg Leu Leu Lys Lys Ala Glu Lys Lys Lys Ser Glu Arg Phe Thr Tyr 
85 90 * 95 

Leu He Ala Ala Val Met Ser Ser Phe Gly He Thr Ser Met Ala He 
100 105 110 

Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Lys Gly Gly Glu Val 
115 120 125 

Ser Val Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala Ala 
130 135 140 

Val Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu Trp 
1^^ 150 155 160 

His Asp Ser Leu Trp Asn Met His Glu Ser His His Lys Pro Arg -Glu 
165 170 175 

Gly Ala Phe Glu Leu Asn Asp Val Phe Ala He Thr Asn Ala Val Pro 

14 
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180 185 190 

Ala He Gly Leu Leu Tyr Tyr Gly Phe Leu. Asn Lys Gly Leu Val Pro 
195 200 205 



Gly Leu Cys Phe 
210 

Tyr Met Phe Val 
225 

Pro He Ala Asn 



Leu His His Thr 
260 

Gly Pro Lys Gin 
275 

Lys Glu He Ser 
290 



Gly Ala Gly Leu 
215 

His Asp Gly Leu 
230 

Val Pro Tyr Leu 
245 

Asp Lys Phe Lys 



Glu Val Glu Glu 
280 

Arg Arg He Lys 
295 



Gly He Thr Met 
220 

Val His Lys Arg 
235 

Arg Lys Val Ala 
250 

Gly Val Pro Tyr 
265 

Val Gly Gly Lys 



Leu Tyr Asn Lys 
300 



Phe Gly Met Ala 



Phe Pro Val Gly 
240 

Ala Ala His Gin 
255 

Gly Leu Phe Leu 
270 

Glu Glu Leu Glu 
285 

Gly Ser Ser Thr 



Ser 
305 



<210> 15 
<211> 315 
<212> PRT 

<213> Capsicum annuum 
<400> 15 

Met Ala Ala Glu He Ser He Ser Ala Ser Ser Arg Ala He Cys Leu 
15 10 15 

Gin Arg Asn Pro Phe Pro Ala Pro Lys Tyr Phe Ala Thr Ala Pro Pro 
20 25 30 

Leu Leu Phe Phe Ser Pro Leu Thr Cys Asn Leu Asp Ala He Leu Arg 
35 40 45 

Ser Arg Arg Lys Pro Arg Leu Ala Ala Cys Phe Val Leu Lys Asp Asp 
50 55 60 

Lys Leu Tyr Thr Ala Gin Ser Gly Lys Gin Ser Asp Thr Glu Ala He 
65 70 75 80 
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Gly Asp Glu lie Glu Val Glu Thr Asn Glu Glu Lys Ser Leu Ala Val 
85 . 90 95 ' 

Arg Leu Ala Glu Lys Phe Ala Arg Lys Lys Ser Glu Arg Phe Thr Tyr 
100 105 110 

Leu Val Ala Ala Val Met Ser Ser Leu Gly He Thr Ser Met Ala Val 
115 120 125 

He Ser Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly Glu Met 
130 135 140 

Pro Phe Ser Glu Met Phe Cys Thr Phe Ala Leu Ala Phe Gly Ala Ala 

150 155 160 

He Gly Met Glu Tyr Trp Ala Arg Trp Ala His Arg Ala Leu Trp His 
165 170 175 

Ala Ser Leu Trp His Met His Glu Ser His His Arg Pro Arg Glu Gly 
180 185 190 

Pro Phe Glu Leu Asn Asp He Phe Ala He He Asn Ala Val Pro Ala 
195 200 205 

He Ala Phe Phe Ser Phe Gly Phe Asn His Lys Gly Leu. He Pro Gly 
210 215 220 

He Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly Met Ala Tyr 

230 235 240 

Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val Gly Pro 
245 250 255 

He Ala Lys Val Pro Tyr Phe Gin Arg Val Ala Ala Ala His Gin Leu 
260 265 270 

His His Ser Asp Lys Phe Asp Gly Val Pro Tyr Gly Leu Phe Leu Gly 
275 280 285 

Pro Lys Glu Leu Glu Glu Val Gly Val He Glu Glu Leu Glu Lys Glu 
290 295 300 

Val Asn Arg Arg He Lys Ser Leu Lys Arg Leu 
305 310 315 



<210> 16 
<211> 316 
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<212> PRT 

<213> Capsicum annuum 
■ ■<400> 16 

Thr Thr Gly Arg Tyr His Tyr Gin Leu Val Trp Cys Gin He Ser Phe 
1 5 10 15 

Ser Ser Thr Ser Arg Thr Ser Tyr Tyr Arg His Ser Pro Phe Leu Gly 
20 25 30 

Pro Lys Pro Thr Pro Thr Thr Pro Ser Val Tyr Pro He Thr Pro Phe 
35 40 45 

Ser Pro Asn Leu Gly Ser He Leu Arg Cys Arg Arg Arg Pro Ser Phe 
50 55 60 

Thr Val Cys Phe Val Leu Glu Asp Asp Lys, Phe Lys Thr Gin Phe Glu 
"65 70 75 80 

Ala Gly Glu Glu Asp He Glu Met Lys He Glu Glu Gin He Ser Ala 
85 90 95 

Thr Arg Leu Ala Glu Lys Leu Ala Arg Lys Lys Ser Glu Arg Phe Thr 
100 105 - 110 

Tyr Leu Val Ala Ala Val Met Ser Ser Phe Gly He Thr Ser Met Ala 
115 120 125 

Val Met Ala Val Tyr Tyr Arg Phe Tyr Trp Gin Met Glu Gly Gly Glu 
130 135 140 

Val Pro Phe Ser Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly Ala 
145 150 155 160 

Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Lys Ala Leu Trp 
165 170 175 

His Ala Ser Leu Trp His Met His Glu Ser His His Lys Pro Arg Glu 
180 185 190 

Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He He Asn Ala Val Pro 
195 200 205 

Ala He Ala Leu Leu Asp Tyr Gly Phe Phe His Lys Gly Leu He Pro 
210 215 220 

Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly Met Ala 
225 230 235 240 
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Tyr Met Phe Val 



Pro Val Ala Asn 
260 

Leu His His Ser 
275 

Gly Pro Lys Glu 
290 

Glu Val Asn Arg 
305 



His Asp Gly Leu 
245 

Val Pro Tyr Leu 



Glu Lys Phe Asn 
280 

Leu Glu Glu Val 
295 

Arg Thr Arg Tyr 
310 



Val His Lys Arg 
250 

Arg Lys Val Ala 
265 

Gly Val Pro Tyr 



Gly Gly Leu Glu 
300 

lie Lys Gly Ser 
315 



Phe Pro Val Gly 
255 

Ala Ala His Ser 
270 

Gly Leu Phe Leu 
285 

Glu Leu Glu Lys 



<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 17 

cagaatcggt ctgttctatt agttcttcc 

<210> 18 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 18 

caatttgagg aatatcaagg ttccttgttc tc 



18 



# 



INTERNATIONAL SEARCH REPORT 



IntcrDauooal applicatioo No. 
PCT/US99/I0455 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :C12P 23/00. 7/26; CI2N 9/02. 1/20. J3/00; C07H 21/04; C07K 14/00 
US CL :435/67, 148. 189. 2523. 252J3, 320.1; 536/23.2, 23.6; 530/350 

According to Intcmabonal Patent Classification (IPC) or to both national classification and IPC 



R FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/67. 148, 189, 252J. 252J3, 320.1; 536/23.2, 23.6; 530/350 



Documcoution searched other than minimum documenution to the extent that such documenu arc included in the fields scatvbed 



Electronic dau base consulted during the international search (name of dau base and. where practicable, search terms used) 
Please See Extra Sheet 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indicau'on. where appropriate, of the relevant passages 



Relevant to claim No. 



A.E 
Y.P 



US 5,453,565 A (MAWSON) 26 September 1995, see abstract and 
claims. 

US 5,910.433 A (KAJIWARA et al.) 08 June 1999, see the entire 
patent. 

US 5,811.273 A (MISAWA et al.) 22 September 1998, See abstract, 
column 30 - lines 48-58 and claims. 



1-20 



1-20 



1-20 



Further documents are listed in the continuation of Box C. ( | See patent family annex. 



L* 



SpMial catceorica of ciud 4ocumBntt: 

documam dcrantng ih« general tuto of ih« an which m « 
lo b* of paniculsT reUvancc 

•vImt document published on or efur ihe tnierneuonal filtn| 4eu 

<locument which mey throw doubu on priority claim<»> or which ae 
csted to esublish the pubiicetion dau of another ciiatkm or other 
special reason (as epeciTicd) 

dooment refeirins to an oral disclosure, use. eahibilion or other 

doeumem published prior u> the international filins date but later than 
_ the priority dale cieimed 



leier ciocumem published erier the inumeiional rUing date or p s iwily 
dale and not in connict with the epplicauoD but ctted lo < ' 
the prmciple or theory underlying the tnveoiion 



ncnt of panicular rslevancc; the cUimed invemioo 

considered novel or cannot be considered to involve an inventree i 
when the document n taken alone 

docvanent of particular relevance; the cieimed tnvaatic» conet 
ID invohre ao inventive eiep whan the deouBOBi 



to a peraoB skilled in the an 
of the same patent family 



Date of the actual completion of the international search 
13 AUGUST 1999 



Date of mailing of the international search report 



29 OCT 1999 



Name and mailing address of the ISAAJS 
Commissioner of Paienis and Trademarks 
Box PCT 

Washington, D.C. 2023 1 
Facsimile No. (703) 305-3230 



Foim PCT/ISA/210 (second sheetXJuly 1992)* 



Authorize^'ii^icef _ /' >^ 

TEl^CHANb SAIDHA 5^^^ 
Telephone No. 0703) 308-0196 





lJ>rrERNATIONAL SEARCH REPORT 



In icroational application No. 
PCT/US99/r0455 



B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

APS, STN Files: Medline, Caplus, Biosis. Agricola. Embase & Sciscarch. Search terms used : beu carotene and 
keiolasc. ketocarolenoid. Adonis aestivalis, carotenoid biosynthesis, gene? or dna or ma or nucleic acid? in various 
permutations and combinations. 



Form PCT/ISA/2 10 (extra shcetXJuly 1992)* 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of the original 
documents submitted by the apphcant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SffiES 

□ FADED TEXT OR DRAWING 



□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 




LURRED OR ILLEGIBLE TEXT OR DRAWING 



